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1 Introduction 



This course is meant as introduction to what is widely considered to be the most beau- 
tiful and imaginative physical theory ever devised: General Relativity. 1 It is well-known 
as the brain-child of Einstein who developed it most or less single-handedly in the years 
leading up to its publication in 1915. It seems that everyone knows something about it. 
Perhaps most impressively the theory was developed without any experimental input 
(although it immediately explained at least one known observational anomaly). Rather 
Einstein made 'thought experiments' 2 which led him to the notion of the equivalence 
principle and then the hypothesis that spacetime is curved. From here one simply adapts 
Newton's famous F = ma law to predict Newton's gravitational law F = —G^mYm^lr 2 
along with corrections. In this way the motion of falling apples, astronauts, moons, 
planets and stars are simply tracing the natural 'straight lines' of a curved spacetime. 
Furthermore it opened up the entire Universe and its history to study in the form of 
Cosmology which is now a highly developed scientific field. 

The plan of the course is as follows. We will first review Special Relativity. It is 
assumed that you have a reasonable knowledge of this as well as tensors such as those 
encountered in Electromagnetism (e.g. F^ u ). We will review (or perhaps introduce) co- 
variant index notation and spacetime. Next we will discuss Einstein's famous thought 
experiment and follow his logic that leads to the notion that, in the presence of gravity, 
spacetime is curved. We will then spend some time developing the mathematics needed 
to understand curved spaces. These are known as manifolds in the mathematical litera- 
ture although we will try to avoid using the abstract mathematical machinery. We will 
then have to spend some getting used to tensors in curved spacetime. This includes the 
all important notions of covariant derivative and curvature. 

We can then write down Einstein's theory. For the rest of the course we will examine 
two classic solutions. The first is the Schwarzchild solution that models the curvature 
about a spherical mass. Here we can derive Newton's gravitational law as the leading 
order effect. However we will see that there are corrections which lead to new predictions. 
The first is the so called perihelion shift. This is where the planets depart slightly 
from a pure elliptical orbit about the sun. This explained observational anomaly of 
Mercury that was known already since the 1800's (although a triumph in retrospect at 
the time it was not clear that there wasn't another explanation, such as a new planet). 
Secondly we will see that light is bent as it passes by the sun. This was a signature 
prediction of General Relativity and was confirmed in 1919 thereby establishing the 
theory and confirming the picture of spacetime as curved. The second solution that 
we will discuss is the Freedman-Robertson- Walker (FRW) metric which describes the 
cosmological structure of the entire Universe, predicting that it started with a Big Bang. 

Thats quite a lot for one idea! 

1 WARNING: this course can make you want to do theoretical physics for the rest of your life. 
2 Unfortunately this has inspired generations of crackpots ever since. 
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2 Special Relativity 



Let us start by a review of Special Relativity. To do physics one introduces a coordinate 
system typically x, y, z and t. Using such a coordinate system one can measure positions 
and velocities, make predictions and test them. That is physics. Here t of course 
represents time and x, y, z space. Before Einstein it was thought that time was absolute. 
And space was absolute. That meant that everyone surely agreed on the passage of time. 
And space was, in some sense, a solid object that we all play around in. Of course the 
coordinate systems that people use need not agree. However there are well defined rules 
that tell you how to convert quantities in one coordinate system (sometimes called a 
frame) to those of another. But time was always just time and measured in seconds, 
minutes, hours, days,.... and we surely all agree on the absolute passage of time no?... 

A fundamental idea of special relativity is that the law of physics do not depend on 
any particular choice of frame. This is so much a part of modern physics that its hard 
to think otherwise. It is formulated in the so-called principle of relativity. To quote 
Einstein himself: 

• Principle of Relativity: If a system of coordinates K is chosen so that, in relation 
to it, physical laws hold good in their simplest form, the same laws hold good 
in relation to any other system of coordinates K' moving in uniform translation 
relatively to K. 

Here moving in uniform translation relatively to K means that it moves with a constant 
velocity with respect to K. In the modern parlance we'd say that the laws of physics are 
covariant with respect to coordinate transformations. 

One important feature of the principle of relativity is that there is no preferred frame 
that the universe uses. Although it is now almost forgotten the prevailing view in the 
19th century was that there must be some medium throughout space and time, called 
the ether, where electromagnetic waves 'wave' in. This then defines a preferred reference 
frame. However it is well-known that the famous Michelson-Morley experiment failed 
to detect any motion of the earth through the ether. Einstein's theory simply did away 
with the ether. 

Apart from this the Principle of Relativity alone is not so new and the older theory 
theories of Newton and others satisfy a form of relativity known as Galilean relativity. 
In Galilean relativity if the coordinates of K are (x,y,z,t) and those of K' (x', y', z', t') 
and K' is moving along the x-axis with speed v then the relation is 

t' = t 

x' = x + vt (2.1) 

y' = y 

z = z 

There are also other possible transformations, for example the spatial coordinates (x 1 , y', z') 
could be related to those of K by a rotation. But t' = t assuming you measure time in 
seconds in both frames! 
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Note that with this change of coordinates we can compute the 'addition' rule of 
velocities. Suppose in frame K a particle is moving with speed u along the x-axis. If at 
ti — it is at Xi — then at time t<i it will be at £2 = ^2- In frame K' we find that it 
is still at x[ = when t[ = but now at time t 2 = t 2 it is at x' 2 = ut 2 + vt 2 . Thus in 
the frame K' one observes the speed 

u' = x' 2 /t' 2 = U + V 

which of course is the familiar rule for addition of velocities. 

The second idea is more radical and defines Special Relativity: 

• Principle of a constant speed of light: The speed of light as measured in any two 
frames that are move with constant velocity respect to each other is the same 
universal constant called c 

What made Einstein think of this? Well the answer is that Maxwell's equations of 
electromagnetism (developed while Maxwell was at King's) have a factor of c~ l on the 
right-hand-side. Thus they don't make any sense in a frame where c = 0. Thus they 
can't be Galilean invariant. 

Clearly this violates the usual rules of changing frames where one would simply adds 
the velocities. This leads to a new set of rules about how to transform between K and 
K'. In fact Maxwell's equations were known to be invariant under a set of coordinate 
transformations known Lorentz transformations which predates Einstein. However Ein- 
stein saw that Maxwell's equations and Lorentz transformations were fundamental (thus 
in a sense King's College is the birth place of Relativity). In particular for a frame K' 
moving along the x-axis of a frame K the Lorentz transformations are 

x' = 7(2; + vt) 

t' = 7 (t + ra /c 2 ) (2.2) 

y = y z' = z 

where 7 = 1/ ^Jl — v 2 /c 2 . 

Problem: Show that the addition law for velocities is 

, u + v 
1 + uv/c z 

Indeed one sees from this that d = c and also v! < c provided that u,v < c. Thus 
one can never go faster than the speed of light by switching from one frame to another 
(assuming no frame is moving faster than the speed of light). This transformation also 
leads to the well-known effects of time-dilation and length contraction. We will not 
review these calculations here as they appear in any course on Special Relativity. 

Soon afterwards a deeper geometrical structure was shown by Minkowski to underlie 
Special Relativity. In particular one sees from the Lorentz transformations that t' 7^ t 
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and furthermore t' is mixed into t and x. Thus time is no longer separate and distinct 
from space. Indeed given the principle of relativity one cannot even pick frame with a 
preferred or absolute notion of time. 

Rather Minkowski introduced spacetime which consists of all four coordinates to- 
gether: (t,x,y,z). It is helpful (on dimensional grounds) to introduce the coordinates 



along with x 1 = x, x 2 = y and x 3 = z. Now three-dimensional Euclidean space can be 
thought of as R 3 where the rule for calculating lengths is 

A/ 2 = (Ax) 2 + (Ay) 2 + (Az) 2 

It is important to note that length is a 'scalar' quantity, meaning that in any admissible 
frame all observers will agree on the length of an object. Minkowski generalized this to 
spacetime with 

As 2 = -c 2 (At) 2 + (Ax) 2 + (Ay) 2 + (Az) 2 = t^A^As" 

where 
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The key idea now is that As is a scalar quantity that all observers will agree on. Note 
the minus sign. Although time can be, in some sense, be rotated into space, it is also 
clearly different. 

An important part of Minkowski spacetime is that (As) 2 can be negative, positive 
or zero. Vectors with such lengths are called timelike, spacelike or null respectively In 
particular the set of vectors with null length defines a surface known as the lightcone 
centred at the origin. Thus spacetime is split into two distinct areas that are separated 
by the lightcone. 

The point is that signals can only travel at or below the speed of light and thus must 
be timelike. This means that an event located at the origin can only be seen in the 
t > timelike (interior of the light cone) region. And conversely only events in the t < 
timelike (interior of the lightcone) can be seen by an observer at the origin. The rest of 
Minksowski space (the spacelike region outside the lightcone) cannot communicate with 
the origin. It is said to not be in causal contact with the origin whereas the interior 
region is in causal contact. 

In ordinary three-dimensional space observers do not agree on the length of an object 
along along a given axis, such as the x-axis, because they disagree on what they've called 
the x-axis. For example if you take a rod and rotate it you can change its 'length' along 
any given axis that is, for example defined by the walls of the room you are in. The 
overall length, as measured by Al is of course invariant but the projection of the length 
onto a particular axis can be changed by a choice of frame. 
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This all happens in Minkowski space too. The only difference is now time is another 
coordinate that can be 'rotated' in to the spatial directions. Thus the projection of the 
length of a rod onto a spatial or temporal axis can change. This is the origin of length 
contraction and time dilation. However there are invariant notions of length defined by 
As. 

From this perspective the Lorentz transformations arise from requiring that As is 
the same in all frames. Thus if x ,fM = A^ v x v then we must have that 

(As') 2 = ^Ax^Ax'" 

= t^A^A^A^Ax" (2.3) 

= (As) 2 

= r) ap Ax a Ax p 

Here we have used Einstein's summation convention whereby a repeat upstairs and 
downstairs index is summed over. Thus we must have, since Ax^ is arbitrary, 

Vo-p ^//t'A^o-A p . 

If we think in term of matrices then this equation is simply rj = A T r)A where we have 
used matrix multiplication. 

More generally a tensor, such as the electromagnetic field strength, transforms in the 
same, covariant, way: F' Xa = F flu A fl a A u p . Thus in a sense the rule Special Relativity is 
that the metric tensor is invariant under Lorentz transformations (however other tensors 
are not). 

Problem: Consider a transformation of the form 

'° 0,7,1 

x = ax + ox 

x' 1 = cx^ + dx 1 (2.4) 

/2 2/3 3 

X = X X = X 

Show that the condition (2) implies that we can parameterize (for simplicity assume 
that a, d > 0) 

a = d = cosh (3 b = c = sinh (3 

for some parameter (3. Using this rederive the Lorentz transformations by writing 
tanh/3 = v/c. 

This was the situation about just before 1915. However it has the problem in that 
it cannot describe acceleration and therefore, according to Newtons laws, any force. 
Although if will work in situations where the acceleration is small and also for electro- 
magnetism which is relativistic. Thus Einstein was led to a more general principle of 
relatively. 

The new principle is motivated by the following thought experiment (slightly mod- 
ernised): You've seen pictures of the astronauts in the space shuttle. They are weightless 
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but why? The naive answer is that it is because they are in outer space away from the 
pull of earth's gravity. But then if this is the case, why do they say in orbit about the 
earth? This is clearly not the case. Rather what happens is that in Newtons laws one 
has 

G N Mm 



ma 



where M is the mass of the earth, Gat is Newtons constant, r the distance from the 
centre of the earth and m the mass of the astronaut. But m is also the mass of anything 
else in the space shuttle. The point is that m drops out from this equations and one 
simply has 

G N M 



a = 



for all objects, astronauts or otherwise. Thus, in free fall, they all follow the same 
path, i.e. there is a universal path valid for all observers in free fall. This of course is 
essentially just the famous leaning tower of Pisa experiment of Galileo, revisited many 
many years later. Einstein;s genius (amoung other things) was to notice that this is a 
rather strange feature that points to a deeper more fundamental interpretation. 

There is a well-known issue here that I am not going to give much time to. It could 
be argued that the m that appears in F = ma is different from the m that appears in 
F = —G N Mm/r 2 . The former is known as the inertial mass m, whereas the later is 
called the gravitational mass m g . It should clear from our discussion that we are heavily 
relying on the assumption that 

mi = m g 

for all objects. All I can say is: "yes we are". But all evidence, to a high degree of 
accuracy suggests that this is true. So let us say no more about this. 

Conversely if we sit in a car and it accelerates we feel a force, just like the force 
of gravity, only it acts to push us backwards, rather than down. The great realisation 
of Einstein was that a single observer cannot perform any local experiment which can 
tell if s/he is in free fall in a gravitational field or if there is no gravitational force at 
all. Conversely they also cannot tell, using local experiments, the differences between 
acceleration and a gravitational field. 

The important concept here is locally. Obviously a freely falling observer can tell 
that they are in the gravitation field of the earth if they can see the whole earth, along 
with their orbit. Certainly they will know if they ever hit the earth. The point is that 
the laws of physics should be local, i.e. they should only depend on the properties of 
spacetime and fields as evaluated at each single point independently. This leads to the 
'happiest thought of Einsteins life', the famous principle of equivalence 

• Principle of Equivalance: The the laws of physics cannot distinguish between 
motion in a gravitational field and acceleration 

Thus we aim to reconcile the following observation. A particle which is freely falling 
in a gravitational field is physically equivalent to a particle which feels no force. And 
furthermore the paths followed by all free-falling object are the same. Now by Newton's 
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law no force should mean that the particle does not accelerate, that it moves in a 
'straight line'. The key idea here is to realise that spacetime must therefore be curved 
and that 'straight lines' are not actually straight in the familiar sense of the word. The 
space shuttle moves in orbit around the earth because locally, that is from one instant 
to the next, it is following a straight path - the path of shortest length - in a spacetime 
that is curved. Just as an airplane travels in a great circle that passes over Greenland 
whenever it flies from Washington DC to London. 

Mathematically we can impose these ideas by noting that a curved space is described 
by the concept of a manifold. Roughly speaking a manifold is a space that in a neigh- 
bourhood of each point looks like El" for some n. Here we have n = 4. For infinitessimal 
variations dx p the proper distance is 

ds 2 = g pu dx p dx u (2.5) 

where g^ is called the metric and locally determines the geometry of spacetime be 
determining the lengths and angles in an infinitessimal neighbourhood of each point. 
By definition g^ is symmetric: g^ u = g vll . We will review manifolds and their tensors 
in more detail soon. 

The general principle of relativity states that the laws of physics are invariant under 
an arbitrary - but invertable - coordinate transformation (not just linear) 

_> x '» = x '»( x ») (2.6) 
under which we have that ds 2 is invariant. The same calculation as above leads to 

ds' 2 = g'^dx' p dx' v (2.7) 

, dx'» , n dx' v , „ , . 

= g'^dx p —dx u 2.8 
^ v dxP dx a v ; 

= g pa dx p dx a (2.9) 

Note that the transformation needs to be invertable so that the Jacobian 

dx'» 
dxP 



is an invertable 4 matrix whose inverse is 



A " = ~~ — (2.11) 



dx' a 
since 

,„ dx' p dx p , ,,, dx' p dx p 

A dxP dx' x A dx x dx'p v ; 

Such a change of variables is called a diffeomorhism. We now see that the invariance of 
the infinitessimal proper distance implies that 

, dx p dx u 
9p ° = dx^dx^ 9 ^ (2 - 13) 
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This is the defining property of a tensor field and we will discuss these in more detail 
soon. 

Thus we see that we have generalised the linear transformation property x" 1 = h^ v x v 
of Special Relativity to arbitrary, but invertable, transformations by including a general 
metric g^ v (x) rather than a fixed one rj^ (this is essentially the same as gauging in a 
gauge theory). The theory of General Relativity treats the metric g^ v as a dynamical 
object and its evolution is obtained from Einstein's equation. 



3 Elementary geometry 
3.1 IR the hard way 

To understand these ideas we need to get used to metrics and curved spaces. We will 
start with an example that we all know, where there is a non-constant metric, although 
the space is not curved: polar coordinates for IR 2 . The Euclidean rule for lengths is 

ds 2 = dx 2 + dy 2 

So here the metric is (using matrix notation) 

(\ 

However one frequently wants to use so-called polar coordinates: 

x = r cos 9 y = r sin ^ 

What is the metric in these coordinates? Well we can easily compute that 

dx = dr cos 9 — r sin 9d9 dy = dr sin 9 + r cos 9d9 

and thus we see that 

ds 2 = (dr cos 9 — r s'm9d9) 2 + (drsin9 + r cos 9d9) 2 

= dr 2 + r 2 d9 2 . (3.1) 

We now have (again using matrix notation) 

- ( X 
9 ~{0 r 2 

and thus has non-constant components. But we are clearly still just taking about Eu- 
clidean IR 2 ! 

These coordinates are good for somethings, but not everything. Indeed they actually 
suffer a pathology at r = 0. Here 9 is not defined. In particular the inverse map is 



\Jx 2 + y 2 9 = arctan(y/a;) 
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but y/x is ambiguous at r = i.e. x = y = (you might object to x = but it is clear 
that 9 = ±7r/2 there, depending on the sign of y). Thus this coordinate system is not 
'global' meaning that it does not cover all of the space one is interested in. In particular 
it doesn't cover r = 0. However this is sufficiently simple that one can work around it 
and pretend as if all is okay. 

In physics we are often interested in two types of computation. The first is to solve 
differential equations such as 

V 2 $ = J 

where <£> is some field and J is some source. For example in electrostatics Maxwell's 
equation for the time- component of the vector potential is V 2 A = p, where p is the 
charge density. In Cartesian coordinates we have (in IR 2 ) 

nU ( d 2 d 2 \ 

V $ = 1 $ 

V dx 2 dy 2 I 



let us see what this is in polar coordinates. We first note that, by the chain rule, 

d dr d 89 d 

dx dx dr dx d9 

x d y 1 d 

r dr x 2 1 + y 2 /x 2 d9 

= cos 6- sm0 — 

or r o9 
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(3.2) 



Similarly 



d n d i n d 

— = sin0— + -cos6» — 
dy dr r d9 



Next we find 

d 2 ( a i . a \ ( a i d 

cos 9- sm9—\ [cos 9- sm9- 



dx 2 \ dr r 89 J \ dr r d9 J 

= COs2 ^ + ^ sin2 ^ + ^ si ^ COS ^ + ^ Sin2 ^ (3 ' 3) 



and similarly 

dy 2 

Thus we find 




d 2 1 d 2 2 did 

sin 2 9— + — cos 2 0^ 2 -^sm9cos9— + - cos 2 9— (3.4) 
dr 1 r z d9 z r z d9 r dr 



d 2 Id Id 2 
dr 2 r dr r 2 d9 2 



3 As well as the fact that (arctan^;)' = 1/(1 + z 2 ). 
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which is often rewritten as 



Hopefully you've seen that (or maybe the 3D version) before. 

The second thing one needs to do is compute integrals of some quantity over some 
region of space. Suppose that one has some function f(x,y) that we want to integrate 
over some region: 

/ f = J J dxdyf(x,y) 

But let us now do it in polar coordinates. We have already computed dx and dy in 
terms of dr and dO. From calculus you know that to change the integration measure 
you must introduce the jacobian factor 

'd(x,yY 



J'-JJ 



drdd 



det 



/M) 



d(r,9) 

Here d(x,y)/d(r,8) is the matrix formed by taking all derivatives: 

( 9(x,y) \ 
\d(r,9) 




cos 9 — r sin 9 
sin 9 r cos 9 



(3.5) 



Thus we see that 



and so 



det ( d ( x >yy 



// = // drd9rf(r,9) 



The thing to notice here is that r = y/det g (using a matrix notation for the metric). 

Why am I bothering to go through this, what's it got to do with curved space? 
Well although this space is not curved, in these coordinates the Laplacian and volume 
form have non-trivial form due to the non-constant metric. We will recover these forms 
later from so-called covariant derivatives and more general considerations of Riemannian 
geometry. 

Problem: Consider spherical coordinates on R 3 : 

x = r sin 9 cos ip y = r sin 9 sin (p z = r cos 9 

Following the same steps show that 

„ 2 l<9/ 2 <9\ 1 ( . „ d \ 1 d 2 
V 2 = — — r 2 — + - , — sin0- 



r 2 dr y dr J r 2 sin 9 89 ^ 89 J r 2 sin 2 9 dtp 2 
and 

// = /// drd9dipjdetgf(r,9,ip) 
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3.2 S 2 

Let us next consider something slightly less trivial that is curved: S 2 . To construct S 2 
of radius R we start in 1R 3 and impose the constraint 

x 2 + y 2 + z 3 = R 2 

We can solve this by taking 

x = R sin 9 cos (p y = R sin 9 sin ip z = R cos 9 

Problem: Show that the entire space is covered by taking 9 G [0, 7r] and tp G [0, 27r). 
Show that the coordinates break down at 9 = 0,ir. 

We start with the normal Euclidean metric on 1R 3 : 

ds 3 = dx 2 + dy 2 + dz 2 

We now substitute in 

dx = R cos 9 cos ipd9— sin 9 sin ipdip dy = R cos 6* sin ipd9+sm 9 cos = —R sin 

to find (note first that the cross d9dip terms cancel) 

ds 2 = R 2 (cos 2 9d9 2 + sin 2 6 dip 2 + sin 2 9d9 2 ) 

= R 2 (d9 2 + sin 2 6d<p 2 ) (3.6) 

This looks similar to polar coordinates on 1R 2 with r = 9 except that the metric coeffi- 
cient of dip 2 is sin 2 r and not r 2 . Thus in effect there are two origins: one at 9 = and 
the other at 9 = ir. Both of these are so-called coordinate singularities where ip is not 
well-defined. 

Another way to think of this is that at a fixed value of 9 there is a circle defined by 
the ip G [0, 2n) coordinate with radius .Rsin^. At 9 = this circle has shrunk to zero 
size. As we increase 9 the circle grows until it reaches at maximum size at 9 = n/2 and 
then it shrinks again until it disappears at 9 = n 

Let us see now why airplanes take the paths they do when flying. Suppose we consider 
a flight path defined by a curve 9 = f(ip). The infinitessimal length of a segment of this 
curve is 

ds = R\jf' 2 + sin 2 fd<p 

and hence the total length is 

l = Rj ^ f' 2 + sin 2 f dip 
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We wish to minimize this integral. To do this we can simply use the Euler-Lagrange 
equation which gives: 



d f f \ sin / cos / 



^ \Jf'i + sm 2 f) Jr + sin 2 f 



This is clearly a rather tough looking equation. However we can find at least one solution 
by setting / to a constant. This implies / = 0,7r/2,7r. The first and last case mean 
that there is no curve at all: the plane is stuck at the north or south pole (in fact the 
coordinates break down there and also the denominator vanishes so we should ignore 
it). The middle case implies that it circles the equator. 

In fact, with a little thought, this is enough. Since the sphere has an 5*0(3) symmetry 
we can always pick our coordinates so that Washington DC and London both lie on the 
'equator' defined by 6 = 7r/2. 4 We see that the shortest path is then a so-called 'great 
circle' between two points. That is a circle whose circumference is R. That is indeed 
the path that planes take (when air traffic control lets them and without volcanoes). 
We will return to this example later when we have some more experience. 



4 Manifolds and Tensors 

Warning: This is a physicists version of a deep and beautiful mathematical subject. 
You won't need to know more in the course but if you'd like to know more you can take 
Manifolds (CM437Z/CMMS18). No apologies will be made here for brutalizing this 
subject, I will just say a few words in passing before we simply go ahead and develop 
tensor calculus without knowing what we are really doing. 

4.1 Manifolds 

We have studied some elementary examples of differential geometry. However in the S 2 
case we thought of it as embedded inside IR 3 . It then inherited its geometry from the 
Euclidean metric of IR 3 . This is said to be extrinsic geometry. However one need not 
think this way. One can simply through away the embedding and think if S 2 as its own 
object with its own intrinsic geometry. 

An n-dimensional manifold is a space that locally looks like H n . 

Formally the definition involves taking an open cover of a topological space Ai, 
that is a set of pairs (C/j, fa) where C/j is an open set of M. and fa : [7$ — >■ !R n is a 
homeomorphsm onto its image, i.e. it is continuous, invertable (when restricted to its 
image in IR n ) and its inverse is continuous. These are subject to two key constraints: 

i)M = UiU t 

4 This of course is not the standard equator that is used in geography. It may seem politically 
motivated but we simply want to find the simplest mathematical solution. 
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ii) If UiHUj ^ then faofc 1 : fa^HUj) C IT ->■ ^(UiHUj) C IR n is differentiate 
(for our purpose we assume that all the partial derivatives exist to all orders). 

What does this mean? Each point p E M is contained in an open set Ui C M. called 
a neighbourhood. The map fa then provides coordinates for the point p and all the 
other points in that neighbourhood: 

fa(p) = {x\p),x 2 (p),x\p),...,x n {p)) e JR n (4.1) 

The key point of manifolds is that there can be many possible coordinate systems for the 
same point and its neighbourhood, corresponding to different choices of fa. Furthermore 
a particular coordinate system does not have to (and in general won't) cover the whole 
manifold. The second point guarantees that if two coordinate systems overlap then the 
transformation between one and the other is smooth. 

The classic example of a manifold is the surface of a sphere, such as the earth. 

Common coordinates are longitude and lattitude. However these don't cover the 
whole space as the north and south poles do not have a well defined longitude. 

We are in addition interested in Riemannian (or technically pseudo-Riemannian) 
manifold which means that we also have a metric g^ u this is an invertable matrix located 
at each point which determines lengths and angles of vector fields at that point, viz: 

\\V{x)\\ 2 = g, v {x)V»{x)V»{x) (4.2) 

Thus if we think of V^(x) as the infinitessimal variation of a curve V 1 = dx^ then 
we recover the definition above for the lenght of an infinitessimally small curve passing 
through the point x^ 

ds 2 = g^dx»dx v (4.3) 



4.2 Tensors 

We now need to develop the rules that the 'good' (covariant) objects will obey whenwe 
pass from one coordinate system to another. Such objects are called tensors. A tensor 
is 'something that transforms like a vector'. Indeed it is just a generalisaton of a vector 
field. You have probably already encountered these in a course on Electromagetism (F^ 
is a tensor) but in the special case that the transformation between coordinates is linear 
(i.e. Lorentz trnasformations). 

We saw earlier that for the notion of the proper distance to be invariant under 
coordinate transformations, i.e. diffeomorphisms, the metric had to transform as 

, dx^ dx u , . 

9ptr = dx>Pdx>° 9llu { } 

This is an example of a (0, 2)-tensor. We could also consider the inverse metric, that is 
the object g pa which is the matix inverse of g^ v : 

= S; (4.5) 
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Problem: Show that 

, dx' u dx' a x 
^ dx^ dx x ^ ^ ^ 

Thus we can define a (p, g)-tensor, or rank (p, q) tensor, on a manfold to be an object 
rpmfi2H3-^p with p upstair indices and q downstairs indices that transforms under 

a diffeomorphsm x^ — > x >lx (x v ) as 

V / UlU 2 U 3 ...U q 

(dx'^dx'^dx 1 ^ dx'^A f dx Xl dx X2 dx Xa dx x «\ plP2P3 ... Pp 
~ \dxPi dxP* Ox** "' dxPi ) \dx' Ul dx'^ dx' u ^ '"dx' u " ) ^ AiA 2 a 3 LaJ 

N.B.: Note the positions of the primed and unprimed coordinates! So the inverse metric 

is an example of a (2, 0)-tensor. 

A tensor field is simply a tensor which is defined at each point on the manifold. 

Thus a scalar is a (0, 0)-tensor; 

<f/(x') = <j>{x) (4.8) 

a vector is a (1, 0)-tensor; 



V"(x>) = g^V^x) (4.9) 



and a covector is a (0, l)-tensor; 



W = ^Mx) (4-10) 



In older books the upstairs and downstairs indices are referred to as contravariant and 
covariant respectively. 

Given two tensors we can obtain a new one is various ways. If they have the same 
rank then any linear combination of them is also a tensor. 

In addition a (p, g)-tensor can be multiplied by an (r, s)-tensor to produce a (p + 
r,q + s)-tensor. 

Finally a (p, g)-tensor j 1 (x) plP2P3 '" Pp AiA 2 a 3 ...a, w ^ n Pi 1 > 1 can be contracted to form 
a (p — 1, q — l)-tensor: 

T{xT«"» XMq = T(xr P2P3 - Pp ,a 2 a 3 ...a 9 (4.H) 

Clearly this can be done in pq ways depending on which pair of indices we sum over 

Also since we have a metric and its inverse g^ v we can lower and raise indices 
on a tensor (this doesn't really create a new tensor so it keeps the same symbol). For 
example if is a vector then 

V, = g^V v (4.12) 
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is a covector. 



Problem: What are the mistakes in the following equation: 



A» X B p °-34C^ = D pjT ° (4.13) 



We can also take the symmetric and anti-symmetric parts of a tensor. Consider a 
(0, <?)-tensor T^ 2 ^„^ q then we have 



^]/ilM2/!3-/ig] = ^ (^Mli"2M3---Mg _ ^M2jUlM3---/iq + •••) 

(4.14) 



where the sum is over all permutations of the indices and in the second line the plus 
(minus) sign occurs for even (odd) permutations. 

4.3 Covariant Derivatives 

Having introduced tensors we can consider their derivatives. However the partial deriva- 
tive of a tensor is not a tensor. To see this we can consider a vector field: 

9 = — 9 (—yp\ 



QjJv Q X W Q x \ ^ Q x p 

dx x dx' p d y p dx x fiPx'* 1 ^ p 
dx' v dxP dx x dx' u dx?dx x 

(4.15) 

The first term is fine but the second one isn't. Although we note that the derivative of 
a scalar is a vector, i.e. a (0, l)-tensor. To correct for this we must introduce the notion 
of a covariant derivative which respect the tensorial property. 

The solution to this is well-known in physics. We introduce a so-called connection 
which modifies the derivative into a so-called covariant derivative and transforms in such 
a way that the covariant derivative of a tensor is again a tensor. Thus we introduce T p p 
- called a connection - and define 

D u V' i = d v V lt + rZ p V p (4.16) 

We then require that under a diffeomorphism T p p transforms as 

/X dx p dx a dx' x ^ T dx a dx p d 2 x' x . 
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We then see that 



f) T X a i\x a a A f)2 ^ 

jjiy'P = UX UX —V p + V 

dx w dx p dx x dx w dxPdx x 



' dx x dx a dx!* p dx a dx p d 2 x' x \ dx'» 1 

Q X W Qrfu Q x p Act q %Iu Q x <rQ x p J Q x ir 



dx' p dx x D yp+ dx x d 2 x ,p yp _ §a 8x p 8 2 x' x v „ 



dx p dx' v dx' v dxPdx x w dx lv dx a dx p 

^^D X V> (4.19) 
dxP dx» K ' 



Problem: Show that we can also write 



,x dx p dx a dx' x dx' x d 2 x CJ 
Y' x = r T + 

in/ r\ ... r\ i. . r\ _ nrr 1 



dx'v dx' u dx T pa dx a dx w dx ,p 

(HINT: consider the chain rule on the second term of the original expression for T' x and 
note the identity <)\l ' \l = —M~ 1 dM for matrices) 

For covectors we define 

D X V, = d x V, - T{ p V p (4.20) 
This ensures that the scalar obtained by contracting V p and satisfies 

D X {V P U P ) = D X V^U, + VDxUp = X {V P U P ) (4.21) 

as we expect for scalars. 

The covariant derivative is then defined on a general (p, g)-tensor to be 

p. rpPlp2Pi—P V _ f) rpplp2P3—Pp 

P A A.-A-....A., — °P 1 AiA 2 A 3 ...A<, 

J_VPl rpUp2P3—pp , 

pu 1 AiA 2 A 3 ...A q "T ••• 

-pu rpPlp2P3---Pp _ (A QQ\ 

L pXi 1 u\ 2 \i...\ q ■■■ 

where each index gets contracted with T^ x 

Problem: Convince yourself that the covariant derivative of a (p, g)-tensor is a (p, q+1)- 
tensor. 

Note that the anti-symmetric part of a connection is a (l,2)-tensor (why?). 
This is called the torsion and it is usually set to zero. In addition the difference between 
any two connections is a (1, 2)-tensor. 

To determine the connection F x v we impose another condition, namely that the 
metric is covariantly constant, D x g pu = 0. This is called the Levi-Civita connection. It 
is the unique connection which annihilates the metric and is torsion free. 
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To determine it (and show that it is unique) we consider the following 

D\g^ v = dxg^u — ^ p Xlx g P v — T^g^ = 
D^g vX = dv9 v \ - T^pA - ^1\9u P = 
D u g Xfl = d u gx^ - K\9pp ~ Kp9\ P = 

(4.23) 

Next we take the sum of the 2nd and 3rd equation minus the 1st (and use the fact that 
r A = T A ): 

= d v g Xil + d^g uX - d x g^ - 2T^g pX (4.24) 
Thus we find that the Levi-Civita is 

r£„ = \g Xp {d,g pv + d v9m - d p9iXV ) (4.25) 

Thus we see that this is indeed symmetric in its lower two indices. It is easy to verify 
that 

£>aSV = dxgpv - T p Xtl g pv - T p Xu g^ p 

= d\gpv - ^{dpgxv + d x g pu - d u g Xp ) - ^{d v g XfJi + d x g pu - d^g^) 

= (4.26) 

The last thing to do is prove that T^ u indeed transforms as it should. To do this we 
note that, from (4.24), we have, by construction, 

= d' u g'^ + &X X - d' x9 'pu ~ 2% (4.27) 

in the new coordinates. Now we know that 

, dx T dx a , dx p d 

9X11 = dx 7 *^ 9 ™ V = dx'v dxP 

We just need to substitute this into (4.27) and determine T' x v . 

Problem: Show that Y x indeed transforms as a connection (HINT: first consider the 
case where the matrices dx^/dx' u are constant). 

Problem: Compute the components of T* v for S* 2 . 

Problem: Consider polar coordinates of IR 2 . Compute g tJ,u D IJt D v f for an arbitrary 
function f(r,9) and show that this agrees with V 2 / computed above. 
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4.4 Geodesies 



We wish to find paths which minimize their proper length, what we can think of as the 
analogues of straight lines. If X^(t) is a path in spacetime where r parameterizes the 
curve and runs from r = a to r = b we need to minimize the functional 



/= f ds= f ^\g^dx»dx v \ = f sj\ gixv X»X v \dT 

J a J a J a 

We do not worry about boundary terms, as we wish to find a local condition on the 
curve X^(t). This is simply a variational problem that you should have encountered in 
classical mechanics where the equations of motion are determined by extremizing the 



Lagrangian C = \j\g ilv X^X v \. 

This turns out to have a technically subtle because it is so non-linear. I don't want 
to bother you with the full details (except as a problem!) so a quick way to avoid this 
is note that we can just as well use the 'Lagrangian' C 2 and hence extremize 



"/ 2 " = J g v xX v X x dr 
The Euler-Lagrange equations give 



dr ^ ' dx^ 
Expanding things out a bit we find 

\ p 2 ox 2 ox 2 ox^ J 

= -2g^(x» + rlX x X») 

(4.29) 

This is equivalent to 

X 11 + T^ U X X X V = (4.30) 

A geodesic that satisfies (4.30) is said to be aflinely parameterized and in what follows 
we will always assume this to be the case. 

The technical point that we skipped is that one can find geodesies which don't 
satisfy this. If we compute the Euler-Lagrange equations that arise from minimizing / 

then there will be factors of \J g^ v X^X u in the denominator. We can ignore these if they 
are constant. And this is just what happens: 

Theorem: Along an aflinely parameterized geodesic g^X^X 11 is constant 



20 



Proof: We simply differentiate 

^(g^X'X") = d x g, v X x X»X» + 2g^X»X» 

= d x g, v X x X»X» - 2 9F iT;i A I" 

= (dxg^-d u g, x -d xgtlu + d,g Xu )X x X»X» 

= (4.31) 

Thus ds = const, x dr along and affinely parameterised geodesic and hence we can think 
of s and r as the same, up to an overall constant. 

In the original calculation we see that I does not depend on the choice of the param- 
eterization X^(r). This is because we can redefine the variable r — > t(t'). As such we 
see that 

dr' dX» , dr 



X» = 



dr dr' 



dr = - — dr' 
dr 



and hence 



{g^X^X^dr 



r-r-Hb) 
Jr-Ha) \ 
r-Hb) 
(a) \ 



dr' dX» dr' dX" 



-I 



dr dr' dr dr' 
dr' 



dr 
dr' 



dr' 



(4.32) 



dX» dX l 



dr' dr' 



where the factors of dr/dr' have canceled. The new curve, viewed as a function of t' 
won't satisfy (4.30). However one can show that there is always a choice of r so that 
the geodesic does satisfy (4.30). 

Problem: Show this. 

Note that this does not work in the expression for "Z 2 ". "Z 2 " only works if we use 
affine parameterization. 

Let us return to our sphere example again. Here we have 



9 2 + sin 2 6tp 2 dr 



I = R 



Before we imagined that 9 = f(f) which in this language means that we choose r — ip. 
This is possible since we have reparameterization invariance, however other choices, such 
as an affine parameter, might be better. Let us exam things in the new way using 



"/ 2 " = R' 2 I 6' 2 + sin 2 Q^dr 



(4.33) 



Since we don't have reparameterization invariance (we must use an affine parameter) 
we can't choose r how we like. We must proceed keeping r general. 
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This is just an Euler-Lagrange system for two fields 9 and ip. However let us note a 
simplification. The "Lagrangian" C = R 2 {9 2 + sin 2 9ip 2 ) depends on ip but not directly 
on <f (there is a symmetry <p — > <p> + const). This means that there is a conserved 
quantity: 

L = ^=R 2 sin 2 fly? (4.34) 
dip 

it's called L because it is related the angular momentum of the motion. The equation 
of motion now reduces to the equation of motion for 9: 

L 2 cos 9 

9-sin9cos9ip 2 = 9- — — = 
y R 4 sin 3 9 

Maybe this doesn't seem much better than what we had before. There is still the 
constant solution at 9 = tt/2 but now ip = Lt/R a . 

One trick is step back a bit and notice that this is the equation of motion for a 
particle with position 9 moving in a potential V = L 2 /R 4 sin 2 9. Indeed multiplying by 
9 and integrating once leads to 

1 T 2 

-ft 2 4- - F 

2 + 2R 4 sm 2 9 

where E is a constant (the total energy). Alternatively we can just substitute (4.34) 
into (4.33). So even if we can't solve the equation we can get a good picture of the 
behaviour of the solutions by thinking of a particle moving in the potential L 2 /R 4 sin 2 9. 

Actually one can solve this equation by substituting 9 = f(y) for some y so that 
9 = f'y. The equation can now be written as 

1 T 2 
2 J V + 2R 4 sm 2 f 

If we choose f = L/R 2 sin / then we get 

2ER 4 . 

y + 1 = — jy~ sm / 

Now the solution to sin //' = L/R 2 is — cos / = Ly/R 2 and hence sin 2 / = 1—L 2 y 2 /R 4 . 
In this way we find 

.9 „ ,-, o 2ER 4 
y 2 + 2Ey 2 = - 1 

Since the left hand side is positive definite we require that E > L/2R 2 and the solutions 
are then simply 

»=(g-^)cos(V2£(r-r )) 
Finally we put the pieces back together to find 



9 = arccos cos 9 cos \ (r — t ) 
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where cos^o = L/2ER 2 — R 2 /L. We can also compute (p by noting that 



V sin 2 6 R* i _ cos 2 Oo cos 2 (Jnf(r - To 



This can be integrated to give 



V = V?o + J 2 £ R6 esc 6» arccot ( sin 9 cot j J~^-( T ~ T o) 



4.5 Causal Curves Again 



g^ u dx^dx u \. Since g^ v is a sym- 



We have been a but cavalier with the expression 
metric matrix it has real eigenvalues. Furthermore in flat spacetime (in the usual coor- 
dinates) we have 
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-1 















1 
















1 





V 











1/ 



(4.35) 



Note that since g^ u must be invertable its eigenvalues can never pass through zero. Thus 
g^u always has one negative eigenvalue and three positive ones. The eigenvector associ- 
ated to the negative eigenvalue is 'time'. Thus vectors can have a length-squared which 
is positive, negative or zero. These are called space-like, time-like or null respectively. 
This is the same as in Special relativity. 

If X^{t) is a curve, i.e. a map from some interval in 1R to Ai, then we can construct 
the 'tangent' vector at a point X M (r ) along the curve to be 



dr 



T=T 



The length of this vector is determined by the metric 



g^(X(r Q )) 



dX» 



dr 



dX h 



= T o dr 



T=T 



(4.36) 



(4.37) 



An important consequence of this is that ||T|| 2 can be positive, negative or zero. 
Indeed for a static curve, where only the time coordinate is changing, ||T 2 || < 0. For 
light rays we have ||T|| 2 = 0. Finally curves for which every point is at the same 'time' 
have ||T|| 2 > 0. Similarly curves are call time-like, null and space-like respectively if 
their tangent vectors are everywhere time-like, null or spacelike repectively. 

The familiar statement of Special Relativity that nothing can travel faster than the 
speed of light is the statement that physical observes always follow time-like curves, in 
particular time-like geodesies, that is curves for which (at all points) 



g^X»X v < 



(4.38) 
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where the derivative is with respect to the coordinate along the particles world-line. 
Similarly light travels along null curves: 

g^X v = (4.39) 

at each point. 

Two events, i. e. two points in spacetime, are said to be causally related if there is 
a time-like or null curve that passes through them. In which case the earlier one (as 
defined by the coordinate of the worldline) can influence the later one. If no such curve 
exists then the two points are said to be spacelike seperated and an obeserver at one 
cannot know anything about the events at the other. 



4.6 Integration 

Let us look at how to compute the integral of some scalar quantity / over spacetime 
(or a part of spacetime). We need the integral to be well defined, i.e. independent of 
coordinate transformations. We first note that, under a diffeomorphism, x^ — > x'^(x) 



d^x = det j d 4 x 



(4.40) 



This transformation can be cancelled by noting that 



-det(^) 



\ 



/ dx x dxP 



\ det (£)\ det O^- det w 



= det(|^) y/-det(g') 



(4.41) 



Here we have neglected indices and viewed the various two-index expressions as matrices. 
Thus we have that 

dV^-dette') = d 4 x^/- det(g) (4.42) 

is a coordinate independent measure and so the a coordinate system invariant expression 
for the integral of / is 

Jf = J d 4 x^/-det(g)f (4.43) 



4.7 Riemann Normal Coordinates 

There is a local coordinate choice that we can use to set Y^ v = at any given point p. 
Note that this is not possible for a tensor which vanishes at p in all coordinates systems 
if it vanishes in one. From D\g^ v = this is equivalent to d^g^ = at p. If we suppose 
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that the coordinates of p correspond to x p = then this means that the metric has the 
form 

9^{x) = g^(0) + 0(x 2 ) 
To arrange this we start with a general metric 

9^{x) = g^(0) + G^x x + 0(x 2 ) 

and consider a coordinate transformation 

x >» = x » + C^ x x u x x + 0{x 2 ) 

Note that without loss of generality we have that C^ x is symmetric in u, A. Thus it has 
the same symmetries and index structure as T^ x so one may hope to use it to, locally 
at p, set T ,p x = 0. Indeed we compute 

dx» - v + vX dx»dx» ~ ^ 

We also need dx^/dx'" but all we need is the very lowest order in x p and this is simply 
obtained as the inverse to dx' p /dx u : 

Next we recall that 

.\ dxP dx a dx' x ^ T dx a dx p d 2 x' x 

r A = r r (4.44) 

^ dx'» dx'» dx T pa dx'» dx w dx°dxP v ; 

We want to show that if Y x v = T x u (0) + 0(x) we can chose C x v so that T' x u = + 0(x). 
This leads to the condition 

+ G(x) = r^(o) + G(x) - 2C X V + 0{x) 

and hence we simply take C x = \T X (0). 

Such a coordinate system is called the Riemann normal coordinate system about p. 
Note that if we consider geodesies that start at p then they take the simple form 

X»(t) = T^r + C(r 3 ) 

for a constant vector T M . Thus the simple flows along the coordinates are geodesies to 
lowest order in r 
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4.8 Curvature 

Partial derivatives commute: [d p , d v \ = d p d v — d u d p = 0. However this is not the case 
with covariant derivatives. Indeed 

[Dp, D U ]V\ = D^{d v V x -V p vX V p )-{n^v) 

= d,(d u V x - Y p vX V p ) - Y^(d a V x - K X V P ) 

-r; x (d u v a -r^v p )-^^u) 
= -dj* x v p + r° x K a v p -(jmov) 

= R»»x P V P (4.45) 

where 

R„x p = -W + - + r^r*, (4.46) 

is the Riemann curvature. 

N.B.: Different books have different conventions for R„ V \ P ■ 
For higher tensors one finds that 



fn 7") ]rpPlP2P3---Pp _ p 1lrpPlP2P-i---P P , 

\U^U V \1 A 1 A 2 A a ...A 9 — ^iivXx 1 7rA 2 A 3 ...A, + 



-Ft pirp^P2P3---Pp 



(4.47) 



R (ll/ x > is a tensor. To see this we note that [D p , D U ]V\ is a (0, 3)-tensor for any V\. 
Thus under a diffeomorphsim 

nT a ht t nT n 

^K-^gg-^r/V, (4.48) 

Now since 

dr' e 

V,= W V. (4-49) 

we see that 



dx°~ dx T dx w p dx' 6 
dx^dx^dx 1 ^ aTn ~dxP ' 
Since V p is arbitrary we deduce that R UT7T P is a (1, 3)-tensor; 



w Pv t _ ux ux ux 



p = ^dx^dx^dx^ e 
" uX dx'» dx'v dx' x dx e arn ; 



It has some identities: 

R {pv)\ = » R [pu\] = ' R ^p = Rx p^ ' D \r R ^]\ P = (4-52) 
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The first identity is obvious. 

To establish the other identities use Riemann Normal coordinates to find a system 
of coordinates such that = at some point p (but not everywhere). Thus at this 
point p 

R,J = ~d,Kx + d„r; x (4.53) 

and so 

R P v\ P — ~9prd^T uX + g pT d v T ^ x 

= ^ {-df.dxQpu - d„,d v g p x + d^d p g vX + d u d x g Pfl + d v d p g pX - d u d p g^ x ) 

= \ (-di*d\9pv + d p d p g vX + d u d x g pp - d u d p g pX ) (4.54) 

Here we can simply check that, at p, R\ pu \] P = and R^xp — R XpiiV = 0. Furthermore 
since these are tensors if they vanishes at p in one coordinate system then it vanishes 
in all. Finally there was nothing special about the point p. Therefore the identities 
everywhere. 

Problem: Prove the final identity D[ T R, X p = 0. 

From the Riemann tensor we can construct the Ricci tensor by contraction 

R»» = R pP u P (4-55) 

and it follows that R^ = R up _. And lastly there is the Ricci scalar which requires us to 
contract using the metric 

R = g^R^u (4.56) 



Problem: Show that 



D» [r pv - l -g, v R)j = (4.57) 



Problem: Compute R puX p ', R pu and R for S 2 . Recall that previously you should have 
computed 

T e w = -sm9 cos 9 = Y% = cot 9 (4.58) 

(HINT: The only independent component is Re^ ■) 
Problem: Show that in two dimensions one always has 

Rpu — ^9ixvR = 
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5 General Relativity 



We are finally in a position to write down Einstein's equation that determines the 
dynamics of the metric field g^ u and examine some physical consequences. 

Einstein's idea was that matter causes spacetime to become curved so that geodesies 
in the presence of large masses can explain the motion of bodies in a gravitational field. 
The next important step is to postulate an equation for the metric in the presence of 
matter (or energy since they are interchangable in relativity). In addition since gravity 
is universal the coupling of geometry to matter should only depend on the mass and 
energy present and not what kind of matter it is. 

Thus we need to look for an equation of the form 

Geometry = Matter (5.1) 

The bulk properties of matter are described by the energy- momentum tensor T^ v . Fur- 
thermore we want an equation that is second order in derivatives of the metric tensor 
(since we want to mimic a Newtonian style force law). Another hint comes from the fact 
that in flat space the energy-momentum tensor is conserved d^T^ = 0. Since this is 
not covariant we postulate that the general expression is D^T^ = Given the identity 

D» [r^ - \g»vR) = (5.2) 

an obvious choice is 

R^u - ~j9^ R + A SV = ( 5 - 3 ) 

where A and k are constants. We won't fix k in this course. For the curious it can be 
computed by comparing to the Newtonian Theory and one finds k 2 = StiGn/c 4 " where 
is Newton's constant. 

Here we have included an additional term which is obviously covariantly conserved: 
D^g^v = 0. There is a long story about A - the so-called cosmological constant. It is in 
fact the biggest mystery in the exact sciences. The problem is that we don't know why 
the cosmological constant is so small, by a factor of 1CT 120 from what one would expect. 
Furthermore recent observations strongly imply that it is just slightly greater than zero. 

Nowadays one usually absorbs the cosmological constant term into the energy mo- 
mentum tensor T MJ , — > T^ v + An~ 2 g lll/ where it is identified with the energy density of 
the vacuum. We will do the same. Thus we take Einstein's equation to be 

Rp, - \g» v R = k%u (5.4) 

It is often said that this equations says "matter tells space how to bend and space tells 
matter how to move" . It turns out that this guess is at least mathematically good: it is 
not overdetermined, i.e. as a set of differential equations it is well posed with a suitable 
set of initial conditions. This would not be the case if we hadn't chosen the left hand 
side to be covariantly conserved. 
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Problem: Show that Einstein's equation is equivalent to 



(5.5) 



6 Schwarzschild Solution 

Let us look for a time-independent and spherically symmetric solution to Einstein's 
equation with T^ u = 0. We take the anstaz: 

rfs 2 = _ e 2A(r) c 2 dt 2 + ^(r)^ + ^2 + sin 2 (fU) 

Note that we could have allowed for terms of the form g tr (r)dtdr. This would still 
be time-independent and spherically symmetric but it turns out we can ignore it and 
this will save us a lot of work. Technically having a time-independent metric is called 
stationary and a stationary metric which has no dtdx terms (for any spatial coordinate 
x) is called static. The difference between the two is like the difference between a lake 
and a river: both are constant in time yet a river is somehow moving and transferring 
momentum whereas the lake is truly static. 
So we have 
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(6.2) 

We start by calculating the non- vanishing Christoffel coefficients. There are two (equiv- 
alent) ways to do this. The first is to use the formula (4.25). The other is to consider the 
equations for geodesies using the action "Z 2 " introduced above, from the Euler-Lagrange 
equations one can read off the components of T^ u from the terms in the X x equation 
that are first order in derivatives. For the metric we have we find the "/ 2 " Lagrangian is 

£ = e 2A ^c 2 i 2 - e 2B ^r 2 - r 2 9 2 - r 2 sin 2 0<j? 

and thus the Euler-Lagange equations are (suitably normalized so that the coefficient 
of the second order term is one): 

i+2A'ri = 

f + B'ff-re- 2B 66-r sin 2 6e- 2B ipip + A'e 2< - A - B) c 2 fi = 

•■ 2 • 

6 + -r& — sin 9 cos 6(pip 



(p H — rip + cot 66(p 
r 



(6.3) 



This leads to (note the factors of 1/2 in when [i ^ v). 



rip 
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Y e = — sin 6 cos 6* 



F y — F y 
1 r6» — 1 0r 



1 



r 



r w = -rsin 2 ^ e - 2B 

r; r = b' 

T r tt = A'e 2(A - B ^c 2 



Kt = K = A (6.4) 



Next we calculate Rp V from 



Rjxv — Rp pv P — ^t^vp + dpFpu ^ pv^p\a + ^ pv^ap (6-5) 

The non-vanishing components are 

R w = ((rB' -r A' -l)e- 2B + l) sin 2 6 



R ee = (j-B' -rA' -l)e- 2B + 1 

R„. = —A" — A' 2 + A'B' + -B' 

r 

Rtt = (A" + A' 2 -A'B' + 2 -A'^e 2( * A - B ^c 2 (6.6) 

Next we see that R„. = and R tt = imply B' = —A'. Hence B = a — A for some 
constant a. In fact we can remove this constant a by rescaling t (which is equivalent to 
shifing A by a constant). This leaves us with two equations 

A" + 2A' 2 + -A' = 
r 

~{2rA' + l)e 2A + 1 = (6.7) 
We can solve the second one by writing it as 

r(e 2A )' + e 2A -l = (6.8) 
so if e 2A = 1 + X then rX' = —X hence we find 

e 2A = 1 - - (6.9) 

r 

for an arbitrary R. Lastly we must check that the second order equation is satisfed. 
Again to see this note that it can be written as 

(e 2A )" + -(e 2A )' = (6.10) 
r 
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which is just the harmonic equation for e in three dimensions. Thus our final answer 
is 

ds 2 = - (l - ^) c 2 dt 2 + + r 2 (d6 2 + sin 2 6d<p 2 ) (6.11) 

r 

This seems like a miracle since we solved two equations for a single function A. 
However the second order equation was guaranteed to be solved if the first order one 
is. This is a consequence of the Bianchi identity (why?) and ensures that the Einstein 
equations are well posed. 

The Schwarzschild metric is in fact it is unique, given our assumptions: 

Theorem: (Birkhoff) The Schwarzchild metric is the unique, up to diffeomorphisms, 
stationary and spherically symmetric solution to the vacuum Einstrien equations. 

We will not prove this theorem here, it simply follows from the Einstein equations. It 
is an elementary version of a 'no-hair' theorem. The surprising thing about it is that the 
solution contains no information about what makes up the matter. The result remains 
true if more complicated matter terms are added to the Lagrangian. 

It seems as if there is a problem at r = R and also at r = 0. Certainly we can't use 
the metric at r = R. If you calculate some curvature invariant, such as R^ uX p R ilvX 
then there is no apparent problem at r = 2R but there is a divergence at r = 0. For 
r < R we can still use the Schwarzchild solution. But note that the role of 'time' is 
played by r. In particular, for r < R, the metric is no longer time- independent. To 
understand more we must look at geodesies. We will do this later. 

For now it is important to note that we do not have to consider the Schwarzchild 
metric to be valid everywhere. For example it is also the unique solution outside a static 
spherically symmetry distribution of matter, whose total mass is M. In particular it 
describes the spacetime geometry outside a star such as the sun or a planet such as the 
earth (ignoring their rotation). 

Problem: Using the expressions for R^ u and setting B = —A but not imposing any 
condition on A, show that the components of G^ u = R^ — ^g^R are 

G w = r 2 sm 2 6e 2A (A" + 2A' 2 + 2 -A'^ 

G ee = r 2 e 2A (A" + 2A' 2 + 2 -A^ 

G rr = -e~ 2A (-A'e 2A + - 2 e 2A --\ 

G * = - c2e2A (l A ' e2A +^ 2A -^) ( 6 - 12 ) 

6.1 Geodesies 

Now we need to physically examine this solution. We start by looking at Geodesies, 
which, according to Einstein, are the paths of free-falling observers. As we have discussed 
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we can obtain these from the modify action 



S = -J drg^X»X v 



'r'-r'P-r'sm'etf (6.13) 



r 



If we think of this as a Lagrangian for four fields t(r), r(r), 9{r) and y?(r), then t and 9 
do not have and 'potential' terms. Thus it follows that 

E = c 2 (l-^y (6.14) 

and 

L = r 2 sin 2 ^ (6.15) 

are constant along any geodesic. 

We still have the 9 equation. We note here that 9 = ir /2 is guaranteed to be a 
solution to the Euler-Lagrange equations (why?). Thus we can simply take 9 = it/ 2. In 
fact one can argue that this is sufficient. Just as in the case of Newtonian physics in 
a spherically symmetric potential, the conservation of angular momentum implies that 
the motion takes place in a plane. This turns out to be true here too so we simply take 
that plane to be 9 = it/2. 

We have not identified what r is but from our discussion of affinely parameterized 
geodesic we saw that ds oc dr. The exact constant of proportionality doesn't matter so 
there are three distinct cases: ds 2 < 0, ds 2 = and ds 2 > corresponding to timelike, 
null or spacelike geodesies. In this case we find 

fds\ 2 ( R\~ l E 2 ( R\- 1 P L 2 . . 

where e = 1 for timelike, e = for null and e = — 1 for spacelike geodesies. A little 
rearranging leads us to 

This is the equation for a particle with position r in a potential 

V - .1' |M - ^ 





r / 

RL 2 

2r 3 



(6.18) 



The 1/r and 1/r 2 terms are the usual Newtonian potential if we take e = 1 for a timelike 
falling observer (the first e term is an irrelevant constant), but we see that there are 
relativistic corrections due to the additional 1/r 3 term. 
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6.2 Newtonian Limit 



Let us look at radial (L = 0) timelike (e = 1) geodesies at large r » R. This 
corresponds to a massive object falling straight down to earth or what ever the source 
of the curvature is. In this case the leading order equations are 



n • 1 o R _£/ 2 

E = c 2 t -r 2 - — = — - 1 
2 2r 2c 2 

Now for a slowly moving observer (slow compared to c) it is sensible to take ct = r, so 
that the curve is parameterized by time, or what an observer at infinity would call time. 
Note the factor of c. This is needed on dimensional grounds as we choose r to have the 
same dimensions as the proper length s and radius r (we did this when we set e = 1). 
Thus E = c. Differentiating the second equation we recover Newton's law: 

.. _ 1 d 2 r R 

r ~ ^ dt 2 ~ 

provided that we identify R = 2GnM/c 2 . 

With this in mind we have fixed the Schwarzschild metric to the more familiar form: 

ds 2 = - (l - 2Gn ^ ,c2 ^ c 2 dt 2 + - ^Im/c* + r \ dQ2 + sin2 ed ^ 2 ) (6-19) 

But in addition we have recovered Newton's gravitation law F = —G^Mmjr 2 as the 
leading order effect of the curvature of spacetime. 

6.3 Perihilion Shift of Mercury 

The higher order corrections to Newton's law that lead to two classic predictions of 
General Relativity which we now discuss in detail 

The first effect is that planets no longer move in elliptical orbits about the sun. 
This deviation is extremely small for most planets but it was already observed prior to 
Einstein that the closest planet to the sun, namely Mercury, does not follow an exactly 
elliptical path. Rather the 'ellipse' slowly rotates. This is called the perihelion shift of 
Mercury. 

To start with lets recover the elliptical motion in the Newtonian limit. Here we look 
for geodesies with L^O. Now it is more helpful to think of r as a function of ip. Thus 
we have 

rim rlr 

Lr'/r 2 



dr dip 

where r' = dr/dip. Let us introduce u = 1/r so that r = —Lu'. The equation for r 



becomes 

2" " c 2 " ' 2" " c 2 ' 2c 2 " 2 



h V _ + h V _ i^i (6 . 20) 
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At large orbits u is small so we can neglect the u 3 term. If we write u = G^Mjc 2 L 2 + u 
and drop the u 3 term then we find 

-L 2 u' 2 + -L 2 u 2 = — E 2 - - - 

2 2 2c 2 2 2 c 4 L 2 

To clean this up we write E 2 = c 2 + G 2 N M 2 /c 2 L 2 + G 2 N M 2 E 2 /c 2 L 2 and obtain 

~/ 2 , ~2 _ G 2 N M 2 E 2 

which has the solutions (putting back the original variables) 

u = — = (1 + E cos(v9 - <p )) (6.21) 

T C -Li 

This is the classic Kepler- Newtonian orbit with eccentricity Eq. It is an ellipse if Eq < 1 
(a circle if E — 0) , parabola if = 1 or a hyperbola if > 1 

Problem: Show that this indeed describes elliptical, parabolic or hyperbolic orbits 
depending on E . 

Next we need to consider the effect of including the u 3 term. Let us start by taking 
the derivative of (6.20) so that we find 



L 2 u" - G N M/c 2 + L 2 u - 3 " ; u = 

c 2 



G N ML 2 
Y 



Now we expand u = u + 5u where m is the solution we found in (6.21). Expanding 
this all out gives 



G N M 3G N ML 2 2 < t2x „ < t2x 6G n ML 2 :\C x .\IL- 

L u YL u Un + L ou +L ou u ou you) =0 

c 2 c 2 c 2 c 2 



The first three terms vanish since u is a solution of the Newtonian problem. We also 
drop the u 5u and (Su) 2 terms since we assume that both u and Su are small but also 
that 5u << Uq. Thus the leading order correction comes from Uq rather than -uo^o- 
The (Su) 2 term gives an even smaller correction. Thus we need to solve 



Su" + 5u = 



3G N M 2 



c 2 



~Un 



If we substitute in the exact solution for -u (setting ip = for simplicity) we find 

3G 3 M 3 ( E 2 \ 

Su" + 5u= ( 1 + 2E cos V + ^(cos(2v?) + 1) j 

The solution to this is 

3G 3 M 3 ( 1 1 \ 
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Ugh. However the interesting thing is the ip sin ip term that grows without bound. This 
arises because the cosy? term is acting as a source for the harmonic oscillator but at 
the resonant frequency. Thus ultimately this term will grow large and invalidate the 
assumption that 5u is a small perturbation. Certainly it will be the dominant correction. 
From the point of view of the full solution nothing too dramatic can happen. The 
interpretation of this term is to observe that 

G N ME (( QG 2 N M 2 \ \ G N ME 3G 3 N M 3 E . 

cos [y 1 - )v) = cosv? + <*» ipsmip+ ■ ■ ■ 

Thus this term can be interpreted as changing the periodicity of the solution. In par- 
ticular we find 

1 G N M ( (f 3G 2 N M 2 

where the ellipsis denotes smaller terms. 

This gives the perihilion shift of Mercury (it's too small for other planets to obeserve). 
We see that the minimum distance of Mercury to the sun occurs at 

// 3G 2 N M 2 \ \ 1 

We have normalized things so that at ip — Mercury is at its closest point to the sun. 
But the next time this happens is 

Thus the orbit is 'rotating' forwards. 




6.4 Bending of Light 

The second prediction of General Relativity beyond the Newtonian theory is that a light 
ray passing close by the sun will bend. It is not clear in the Newtonian theory how to 
calculate the bending of light since it has no mass. You might try to interpret it as a 
particle and assign some effective mass for it due to its energy, i. e. by including effects 
of Special Relativity. However the amount predicted is only half that observed. But in 
General Relativity it is clear what to do: we look for geodesies with e = but L ^ 0. 
So we just follow the steps that we did for the perihilion shift of Mercury. The equation 
is now 

2' 2r 2 c 2 r 3 ' " 2c 2J 



1. 2 L 2 G N ML 2 1 ,,, 

o^ 2 + ^ " -^r- = ^ e2 ( 6 -22) 



which in terms of u is 



\LV + \LW - °^u> = ^ (6.23) 
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Again we start by neglecting the u 3 term. This time we take for our solution (let us 
set, with out loss of generality, (po = 0) 

IE . 
u = - = — sin (p (6.24) 
r cL 

This represents a light ray that comes in from infinity at ip — and then out again to 
infinity at ip — tt. In particular if we write x = r cos (p and y = r sin ip then the solution 
is just 

cL 
V= ~E 

Thus the light ray is undeflected and its closest distance to the run is r min = cL/E. 

Let us consider the effect of the u 3 term that comes from General Relativity. Again 
writing u = Uo + 5u, where uo is the solution (6.24), and differentiating we obtain the 
equation 

L\4 + 8u") + L\u + Su) - 3GN f L \ u + 8uf = 

Once again we only need the term that is linear in 5u (again the terms linear in uq 
vanish since u is a solution). This gives 

3GmM 



„ „^ N 1V1 2 
OU + OU = r U + . . . 



c 



Substituting in for uq we find 



so the solution is 



r II r 3GnME 2 

5u" + 5u= 2g4L2 (1 - cos(2v9)) 



. 3G N ME 2 G N ME 2 

5m = ^Z^ + ^Z^ cos(2 ^ 



or 



1 E 3G N ME 2 G N ME 2 . , 

r = ^ Smv?+ ^Z^ + ^Z^ C ° S(2V?) 



E 3G N ME 2 G N ME 2 , 9 



Next we need to study this solution in the asymptotic regions where r — > oo. This 
gives a quadratic equation for sin ip: 

sinV-^^sinv,-2 = 

The solutions are 



SinV = 2G^ ± Ug{we 2+% 

c 3 L ( 18G 2 M 2 E 2 \ , x 

1±1±-^-— + ... (6.26) 



2G N ME V 2 c 6 L 2 
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Now the effect needs to be small as G^M — > and thus only the lower sign solution 
makes sense (i.e. has | sm<p\ < 1): 

2G N ME 

sin (p = — — 

c 6 L 

Expanding for small ip we find the ray comes in at 

2G N ME 

<Pi = 57 — + • • • 

c 6 L 

and leaves at 

2G N ME 
p 2 = ,+ —^- + ... 

Corresponding to a deflection angle of 

AG N ME 

Ap = — + . . . 

c 6 L 

Note that E an be related to the smallest value of r that the light ray takes. In the 
unperturbed solution we see that the minimum value of r is r min = cL/E. To leading 
order this is unchanged in the perturbed solution so that 

Ap = — 2 + . . . 

C T' m i n 

6.5 Gravitational Redshift and Time Dilation 

There are other important effects of General Relativity. Just as in Special Relativity 
spacetime has the 'ability' to cause time dilation and Doppler-like red-shifts. Let us 
look at a two observers. For simplicity we assume that they are both are at a constant 
value of r. Each observer sits at a constant values of r, 6 and ip. To them time is 'proper 
time' in the sense that (As) 2 = — c 2 At 2 and hence 



/ 2G N M/c 2 . 
AU = Wl — '—At 



for i = 1, 2. This leads to 

At x At 2 



2G N M/c 2 H 2GnM/c t 

n V f'2 



More generally we have (as long as g 00 is time-independent) 

Ah _ gooixt) 
At 2 \ g o(x 2 ) 
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A standard notion of a clock is given by the (inverse) frequency of light: At = \ jv. 
Suppose that an observer at r = r\ emits a light ray that escapes to some larger value 
of ri> r\. If the frequency of emitted light is v\ this corresponds to taking At\ = \jv\. 
However at infinity an observer will measure: 



V2 



1 - 



2G N M/c 2 
ri 

2G N M/c 2 1/1 
■r-2 



Thus i>2 < v\. This is known as a red-sift since light will sign as more red (lower 
frequency) at infinity than it is as seen from where it was emitted. In the limit that 
r 2 — > oo we get 



2G N M/c 2 



n 



and in particular < v 1 if 2GjyM/c 2 < r\ < oo. Thus the light seen at infinity has a 
lower frequency, ie it has been red-shifted. 



6.6 Schwarzchild Solution as a Black Hole 

So far we have been treating the Schwarzschild solution at distances r > 2G^M/c 2 . 
However it is most famous for it's behaviour at r < 2G N M/c 2 . Let us look at radial 
geodesies, these correspond to in-falling observers traveling at constant values of 9 and 
<f. Recall that our effective dynamical system is 

1 . 2 1 G N Me 1 2 

2 r + 2 e - = 2^ E (6 ' 27) 

There is nothing special from this equation about r = 2GnM/c 2 . Indeed only r = 
seems singular (there is an infinite potential there). It follows from general considerations 
that one will pass through r = 2G^Mjc 2 in a finite afline time (that is a finite value of 
r). In addition one will hit r = too. However these geodesies are parameterized by 
the proper time r, which is the time that the in-falling observer feels. Let us consider 
what an observer at a safe distance from r = 2G^Mjc 2 sees. 

Thus we want to consider dr/dt as opposed to dr/dr. We simply note that 

dr 
dr 




1 - 



2G N M/c 2 \ 1 dr 
r j ~dt 



(6.28) 



From here we see that the geodesic equation is 




2 / ^/, 2 \2 



= ii- 2C ' M n >-yi- ic ^ /c 
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Near r — » oo this modification does not do much as t ~ Er. However as we approach 
r = 2GnM/c 2 we see that 

J_ (dSrV 1 / _ 2G N M/c 2 V 
2c 2 \dt ) 2 \ 2G N M/c 2 + 5r) 

1 ^ ;(5rf + ... 



2AG 2 N M 2 /c 4 

where r = 2G N M/c 2 + 5r. Thus we see that 

d5r 5r 



dt 2G N M/c 2 
and hence, near r = 2GnM/c 2 we have 



(6.30) 



+ ... (6.31) 



5r = e ^ n m/^ ( 6>32 ) 

This shows that r never reaches r = 2G^Mj(? for any finite value of t. Thus an observer 
at infinity, for whom t is the time variable, will never see an in-falling observer reach 
r = 2GjvM/c 2 . Whereas we saw that the in-falling observer will pass smoothly through 
r = 2GnM/c 2 in a finite proper time. 

This is gravitational red-shifting gone mad. If we fix one observer at infinity, r\ = oo 
then our red-shifting formula states that, for every one second in our frame the in-falling 
observe has 

1 

as T2 — > 2GnM/c 2 . Thus to an observer at infinity the clocks on the in-falling observe 
slow down to zero as they get closer and closer to r = 2G^M/c 2 . They appear to 
become frozen and live for ever. 

Thus to an outside observer the region r < 2GnM/c 2 is causally disconnected, they 
cannot send in any probe, say a light beam or an astronaut, which will be able to go into 
this region and return. The surface r = 2GnM/c 2 is called the event horizon because 
observers outside the horizon will never be able to probe what is beyond r = 2G N M/c 2 , 
whereas a freely falling observer will smoothly pass though in a finite time. Of course it 
also can be shown that, as is well known, no signal inside r = 2GfqMjc 2 can reach the 
outside. This is the classic example of a black hole. 

Clearly we need to understand the spacetime near r = 2G^Mj c 2 better. In particular 
the coordinates are breaking down but this does not mean that spacetime is breaking 
down. This could be, and in fact is, just a coordinate singularity. 

To find a new set of coordinates let us consider a coordinate that is well-suited to 
the in-falling observer. In particular consider such an observer who is lightlike, so that 
ds 2 = 

2G N M/c 2 \ 2 2 dr 2 



= - 1 — — c A dt z + 



2G N M/c 2 



1 - 

r 
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This gives 



±cdt = o = dr H — ^- — ° . „ dr 

^ 2G N M/<? y r-2G N M/c 2 



We can integrate both sides of this to find 



2G N M , / c 2 



±cr = r H — in 



c 2 V 2G ^v M / 

Here we see that t — > oo as r — > 2GnM/c 2 , just as we argued above for the timelike 
case. Therefore we can choose to use the new coordinate 

2G N M , / c 2 A 
u = ct + r H — In 



c 2 V 2G ^v M 

so that rf-u = along an in-falling null geodesic. 
By construction we have 

dr 

du = Cdt + - 2G N M/^ 
r 

so we can rewrite the Schwarzschild metric as 

2 



2G N M/c 2 \ dr \ dr 




2 



dg2 = _ I i _ \ \du-- Z, MI , 1 +- Z, Mlr z + r 2 {d6 2 + sin 2 6d V 2 " 



2G N M/c 2 I 1 ^ _ 2G N M/c 2 
r I r 

= -{l- 2GN ^ /C2 ^j du 2 + 2dudr + r 2 (d9 2 + sin 2 6d V 2 ) (6.33) 

These are called in-falling Eddington-Finkelstein coordinates. Now look: there is noth- 
ing singular about r = 2G^Mjc 2 . While it is true that g uu = there the metric is still 
invertible due to the drdu term. However r = is singular. Indeed one can show that 
R^\ p R^ X P diverges at r = 0. 

Problem: Compute the inverse metric and show it exists at r = 2G^M jc 2 . 

Problem: Show that it is impossible to find a coordinate transformation that makes 
R^u\ p R^ X p well behaved at r = 0. 

We can get an intuitive notion for what is happening by observing that the original 
coordinates r,t are valid at r > 2G^Mjc 2 and also at r < 2G^M 2 jc 2 . The first region 
we have studied a length in the previous sections. Consider the second region. Here the 
metric is better written as 

ds 2 = - 2G Jf c2 + ( 2 ^MII _ A cH t 2 - +r 2 (d9 2 + sin 2 6d^ 2 ) 
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so that here r is time and t is space. The metric is now wildly time dependent. Thus the 
roles of space and time have interchanged. As we have seen they have done so smoothly. 
Looking again at the Eddington-Finkelstein coordinates we see that at large r we have 

ds 2 = -du 2 + 2dudr + r 2 (d9 2 + sin 2 6d</) 

If we write r — (v + u)/2 then we have 

ds 2 = dudv + r 2 (d6 2 + sin 2 6d^ 2 ) (6.34) 

These are the normal coordinates for Minkowski space where u, v describe the lightcone 
about t — r — 0. That is the light cone consists of the points u = constant or v = 
constant. 

Problem: Show that in flat space the with u = ct + r, v = —ct + r the metric is (6.34) 
and that the light cone through t = r = is parameterized by u = or v = 0. 

As we fall into r = IGjqMjc 2 the light cone defined by u,v or r,t 'tips'. Eventually 
we reach r = IG^Mjc 2 where the Eddington-Finkelstein metric is 

ds 2 = 2dudr + r 2 (d6 2 + sin 2 6d^ 2 ) 

Thus the light cones have tipped so much that u and r are both lightlike (u is lightlike 
everywhere). Once in the region r < 2G^Mjc 2 we again have lightcones but they have 
tipped so much that the singularity at r = (which should now be thought of as a 
time in the future) now lies inside the light cone of an in-falling observer. This means 
that they are doomed to hit r = 0. It has become an event in their future, that exists 
throughout space, and it therefore unavoidable. 

The surface defined by r = 2G^Mjc 2 is called an event horizon and the radius 
2GnM/c 2 is called the Schwarzschild radius. We have seen that lightrays and observes 
who fall in through r = 2G^M/c 2 cannot escape back to r > 2G^M/c 2 and in fact are 
doomed to hit r = where there is a singularity. This is known as a Black Hole. 

Does this mean that there is a Black Hole at the centre of our sun? No. The point is 
that the Schwarzschild metric is the solution in the absence of matter (i.e. with T^ u = 0). 
A star, like our sun, has matter in the region r < R s where R s is the radius of the sun. 
The Schwarzschild solution is then valid for r > R s . 

A different, non-singular, solution is required for r < R s corresponding to the fact 
that Too 7^ there. However if there is a body that is so dense that its radius is less 
than it's Schwarzchild radius then it will be a black hole as the Schwarzschild solution 
is valid for values of r that include 2GnM/c 2 . 

For example the sun has R s = 7 x 10 7 m and 2G^Mjc 2 = 3 x 10 3 m. However there 
is a black hole at the centre of our (and most) galaxies. 
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7 Cosmology: FRW 



The idea now is to model the cosmic history of our universe. In particular we assume 
that the spacetime has spacelike hyper-surfaces with the maximum symmetry. Therefore 
the 3-dimensional spatial cross sections are either hyperbolic space H 3 , flat space R 3 or 
spheres S 3 . Let us endow this space with local coordinates x % and metric 7^. Without 
loss of generality we can take the four-dimensional metric to be of the form 



Thus the only parameter is the scale factor a(t) which gives the physical size of the 
spatial hyp ersurf aces. 

We also consider a non-vanishing energy-momentum tensor. To be consistent with 
the symmetries we assume that 



i.e. we assume that the only non- vanishing components consist of an energy density 
p as well as an isotropic pressure p; L . Note that we do not assume that p and p are 
constant. Thus the general assumption of the model is that, on a large cosmological 
scale, the universe is isotropic and homegeneous. The former means that there is no 
preferred direction whereas the latter means that there are no preferred points. These 
seem like very reasonable assumptions. Certainly as far as we can tell, on the largest 
scales that we can observe, the universe consists of an even but sparse distribution of 
galaxies and hence can be thought of as homogeneous. As far as we can tell the universe 
is also isotropic however since we can only look out from where we are it could be the 
universe is not isotropic, so that it has some kind of centre, in which case we must be 
relatively near the centre. However such an earth-centric view has been out of fashion 
in cosmology for hundreds of years, i. e. since Copernicus. 

This means that we choose the metric 7^ to be one of three possibilities: R 3 (a 3D 
plane), S 3 (a 3D sphere) or H 3 (3D hyperbolic space). These are the three manifolds 
that are isotropic and homogenous, meaning that there are no preferred points and no 
preferred directions. Since we can absorb a constant scale factor these metric take the 
form 



ds 2 = -c 2 dt 2 + a 2 (t)'ji j dx i dx j 



(7.1) 




(7.2) 



dsl = -f ij dx i dx j = — + r 2 (d6 2 + sin 2 9dip 2 ) 



With k = —1, 0, 1. Clearly for k = we have R 3 in spherical coordinates: 



dsl(k = 0) = dr 2 + r 2 dB 2 + r 2 sin 2 6d<p 2 



For k = 1 we write r = sin ip and find 



ds 2 3 (k = 1) = di? + sin 2 ^dO 2 + sin 2 sin 2 9dip 2 
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which is just the metric on S* 3 . For k = — 1 we write r = sinhp in which case we have 

dsl(k = -1) = dp 2 + sinh 2 pd6 2 + sinh 2 p sin 2 6dip 2 

This is the metric on hyperbolic space. Unfortunately we don't have time to go into the 
details here. In all these cases one finds the the Ricci tensor constructed from 7^ 
satisfies 

7*3 j = 2k^ij 

Experiments indicate that, in our universe, k = so if you'd like you can simply take 
this case and use 7^ = 5ij in Cartesian coordinates. 
We need to solve the Einstein equation 

Rfiv ~^9)ivR ft Tfxv 

To begin with we must calculate the Levi-Civita connection coefficients 

T % = \{~^) ^9tj + d j9u - dt9ij) = ^ad-fij 
Hi = l Al l \d t g l3 +d m -d igt] ) = -5) 

Zi (X (X 

rj fc = l^ 2 l U (d J g lk + d k g lj -d l g jk )=Y jk 



(7.3) 



where 7* fc are the Levi-Civita connection coefficients of the spatial metric 7^. From 
these we can calculate the Ricci tensor 

R»v = R^ x = -d^vl x + d x T* v - r^ A r* CT + r^,r* A (7.4) 

We find the non-zero components are 

Rtt = -d t (3-)-3- 2 
a a 1 

= -3^ 
a 

1 2 3 

Rij = nj + —dtiaa-fij) - ^a 2 %j + ^a 2 ^ 

1 2 

= rij + -^ad'jij + ^a 2 7ij (7.5) 

(7.6) 

where r\j is the Ricci tensor for the spatial manifold. Since it is maximally symmetric 
we have that 

m = 2k^j (7.7) 
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for some k — 0, —1, 1. Continuing we see that the Ricci scalar is 

R = -c^Ru + a-^Rij 



6 a „ k 6 a 2 ,_ oX 

+ 6 ^2 +^2^2 ( 7 - 8 ) 



cf a cr cf a 
Putting these together we find that the Einstein equation is 

Cp 1 Qp JvC^ 

Rtt + ^rR = 3— + 3 — 

2 a 2 a 2 

= K 2 T tt 

1 2 1 

R ij ~ n a2 lij R = ( 2 aii 2^ ~ 



^ C C 

/t 2 T 4J (7.9) 



Thus we find the equations 



d 2 kc 2 2 
3— + 3— = k p 
a 2 a 2 

2- + - 2 + % = -k 2 c 2 P (7.10) 
a a 2 a 2 

The first equation is known as the Friedman equation. Note that it is first order in 
time derivatives. This is a consequence of the Bianchi identity. Quite often the second 
equation is rewritten, using the Friedman equation, so that 

a 2 kc 2 1 



r 2 

a 2 a 2 3 



2 + ^2 = o^P 



I = -^ 2 (3pc 2 + p) (7.11) 

One famous feature of these equations is that we can only find a static universe if 
p = p = (in which case we have Minkowski space). Thus the universe is evolving. 
Indeed not long after General Relativity was invented Hubble observed the expansion 
of the universe (which at that time was not a prediction of General Relativity). 

Often one introduces the Hubble 'constant' H = a/a however, except for the case 
of exponential expansion, H is not constant (although its time variation is over cosmic 
scales). 

Note that at late times, meaning large a, one can drop the k term from the equations. 

In addition the matter 'equation of state' is often taken to be p = wp/c 2 where w is 
a constant. For any known type of matter one has that w > — 1 and so this is generally 
assumed to be the case. In this case one can solve the equations. Let us assume for 
simplicity that k — and try 

a = a l t 1 (7.12) 
for some constant 7. From the Friedman equation we see that 

l -K 2 p = 1 2 r 2 (7.13) 
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and hence substitution into the remaining equation gives 

7(7 -i)r 2 = -I K 2 (3 W + i) P 

3w + 1 2i o 

= ^— 7 2 t (7.14) 

This implies that 

— 2 — 7 =7 (7.15) 

and hence we find 

a(t) = ai t^ (7.16) 
where ai = a(l). In the limit that w = — 1 the solution becomes exponential: 

a = aiev 3 

This spacetime is known as de Sitter space. 

Notice that in the w > — 1 case there is a singularity at some early time where a = 0. 
This is a genuine curvature singularity. Thus the present observation of the expansion 
of the universe points to in initial 'Big Bang' singularity. Note that the Universe isn't 
expanding into anything and indeed for k 7^ 1 the 'size' of the spatial cross sections is 
infinite. 

Problem: Find a change of variables that maps the metric 

ds 2 = —c 2 dt 2 + a 2 {t)^ijdx l dx^ 

to so-called conformal time: 

ds 2 = a 2 (r])(—dr] 2 + r ) i jdx l dx^) 

What is 77 exactly for the case that a(t) = ait 1 ! 

Another concept that arises is that of a particle horizon. Consider a massless particle 
moving such that, along E, r y i jdx l dx^ = dr 2 for some variable r. It follows a null geodesic 

= - c 2 dt 2 + a 2 dr 2 (7.17) 

Thus in the time after t = t the particle will travel a distance 

/■* dt' 

R = c / (7-18) 

Jto a{t>) y ' 

The point is that this can be bounded. For example if the Hubble parameter H = a/a 
is constant, i.e. a = aoe Ht where ao = a(0), then 

R = -ca^H^e^X = ca^H' 1 ^- 11 * - e~ Ht ) (7.19) 
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Thus if we send the particle out today so that t = 0, even if we wait until t — > oo we 
see that R will be bounded, i. e. the particle can only make it out a finite distance due 
to the expansion of the universe. 

Another point to notice is that at different periods in the expansion different fields 
may have dominated, with different values of w, thereby leading to different expansion 
rates (but always positive). In particular initially the Universe was hot and the matter 
in the form of atoms had not yet formed and hence the main contribution was largely 
photons and radiation. The energy momentum tensor of the electromagnetic field is 
traceless g^T^ = and hence w — 1/3. In this 'radiation' era the Universe expanded 
as a oc t 1 / 2 . 

Problem: Show that g^T^ = and w — 1/3 for the electromagnetic field. (HINT: 
you may recall that T^ u = F^ X F U X - \g^F 2 ). 

After the Universe cooled down enough matter did form and the dominant source 
of the energy momentum tensor was from a pressureless gas of matter. Here w = and 

2 

hence a ~ ts. This is known as the matter dominated era. 

We now think there there are two other era's in the universe's expansion. Both of 
these correspond to w — — 1 (i.e. p = — p) and exponential expansion. In this case we 
see that 



This type of energy momentum tensor is caused by a non-vanishing, vacuum energy 
p. Such a contribution to Einstein's equation is called a cosmological constant, it 
corresponds to adding in a term to the Einstein equation 



However this is now normal viewed as coming from a vacuum energy term Vg^ v in T^ u . 

What are the two era's? One is now. Observation now strongly supports the notion 
that we are undergoing small but exponential expansion. The associated cosmological 
constant A is small but, in time, the exponential growth in a has come to dominate. 

The other era was in the early universe, before the radiation dominated era. This is 
called inflation and the cosmological constant or vacuum energy in that case was large. 
This era is still somewhat speculative but it has become the dominant idea in cosmology 
and has begun to pass observational tests. However the original motivation for inflation 
was to solve some problems with the standard expansion scenario. In particular there 
are two key problems (there are others): 

• Why is the universe in thermal equilibrium across the whole sky? We have measure 
the temperature of space and it is the same everywhere up to fluctuations of the 
order of 10 -5 . Under the power-law expansions of the matter and radiation era's 
different parts of the sky have not been in causal contact with each other. So it is 
rather striking that their temperature is the same to with a factor of 10~ 5 . 





-g^R + Ag, 
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• Why is the universe so close to k — 0. The universe that we observe today is very 
flat. Although expansion clearly flatens space (the effect of k drops as a~ 2 ) the 
power law expansions of the matter and radiation era's are not enough to account 
for this unless we assume that the initial universe was very very flat. 

To solve these problems one postulates a period of vast expansion. In particular ere 
one requires a to have grown by a factor of e 60 , i.e. 

a(after) = e 60 a(before) 

So the most natural thing to imagine is the exponential expansion associated to a cos- 
mological constant.. The numbere 60 is referred to as 60 e-foldings. Therefore the part 
of the universe that we see all came from an extremely small, quantum-sized, portion 
of the universe immediately after the big bang. Indeed the temperature fluctuations 
that we see, and which led to structure formation, are just quantum fluctuations of the 
initial space. 



Appendix: Comparison with Gerard Watt's Course 

In recent years this course has been given by Dr Watts. Although the content of his 
course and this one are essentially the same there are some conventions and notations 
that differ. Thus in order to read his notes and practice his exams perhaps it is helpful to 
list here some of the differences between his course and this one. With these comments 
in mind you should be able to translate all of the questions in his exams into a form 
that you can try to solve. 

• He uses the (+,—,—,—) signature for the metric, i.e. Minkowski space has the 
metric ds 2 = c 2 dt 2 - dx 2 - dy 2 - dz 2 . 

• He discusses the Newtonian force law more generally {i.e. not just about a spher- 
ically symmetric source): 

Fi = —mdiip V 2 y? — 4:TtGnp(x) 

where p(x) is the mass distribution. The F = —G^Mm/r 2 law that we used 
simply corresponds to a point source at the origin: p = M5(x). 



He calls a (p, q) tensor a 



p 
Q 



tensor 



When discussing geodesies he introduces the notion of covariant derivative along 
the curve: 

x^u* = x^u x + r* eh 
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dX^/dr is the tangent to the curve 



DU X 




Dt 



because U x = X^dJJ 



by the chain rule. 



A vector U x is said to be parallel transported along a curve A M (r) if DU x /Dr = 0. 
An affine geodesic is a curve whose tangent vector is parallel transported along 
itself: DX x /Dt = 0. 

• In cosmology he writes out the cosmological constant term explicitly so that 



whereas we absorb A into p and p and only consider the second term (note that 
—c~ 2 p = p = A in (7.2) is just a cosmological constant ). 
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